RNA sequencing data: biases and normalization

نویسندگان

  • F. Finotello
  • E. Lavezzo
  • L. Barzon
  • P. Fontana
  • A. Si-Ammour
  • S. Toppo
  • B. Di Camillo
چکیده

Motivations In recent years, RNA sequencing (RNA-seq) has rapidly become the method of choice for measuring and comparing gene transcription levels. Despite its wide application, it is now clear that this methodology is not free from biases and that a careful normalization procedure is the basis for a correct data interpretation. The most common normalization techniques account for: library size, gene or transcript length and sequence-specific biases such as GC-content effects. The aim of the present work is to investigate biases affecting RNA seq data and their effect on differential expression analysis. In order to reduce biases due to over-simplification of gene transcription models, we consider exon-based counts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using normalization to resolve RNA-Seq biases caused by amplification from minimal input.

RNA-Seq has become a widely used method to study transcriptomes, and it is now possible to perform RNA-Seq on almost any sample. Nevertheless, samples obtained from small cell populations are particularly challenging, as biases associated with low amounts of input RNA can have strong and detrimental effects on downstream analyses. Here we compare different methods to normalize RNA-Seq data obta...

متن کامل

Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments

Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in...

متن کامل

Package ‘ scran ’ April 15 , 2017

April 15, 2017 Version 1.2.2 Date 2017-01-18 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Vect...

متن کامل

Package ‘ scran ’ January 15 , 2017

January 15, 2017 Version 1.2.1 Date 2017-01-11 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Ve...

متن کامل

Package ‘ scran ’ January 31 , 2017

January 31, 2017 Version 1.2.2 Date 2017-01-18 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Ve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015